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Abstract — his work proposes a blind adaptive reduced-rank 
scheme and constrained constant-modulus (CCM) adaptive algo- 
rithms for interference suppression in wireless communications 
systems. The proposed scheme and algorithms are based on a 
two-stage processing framework that consists of a transformation 
matrix that performs dimensionality reduction followed by a 
reduced-rank estimator. The complex structure of the transfor- 
mation matrix of existing methods motivates the development 
of a blind adaptive reduced-rank constrained (BARC) scheme 
along with a low-complexity reduced-rank decomposition. The 
proposed BARC scheme and a reduced-rank decomposition based 
on the concept of joint interpolation, switched decimation and 
reduced-rank estimation subject to a set of constraints are 
then detailed. The proposed set of constraints ensures that the 
multi-path components of the channel are combined prior to 
dimensionality reduction. In order to cost-effectively design the 
BARC scheme, we develop low-complexity decimation techniques, 
stochastic gradient and recursive least squares reduced-rank 
estimation algorithms. A model-order selection algorithm for 
adjusting the length of the estimators is devised along with 
techniques for determining the required number of switching 
branches to attain a predefined performance. An analysis of the 
convergence properties and issues of the proposed optimization 
and algorithms is carried out, and the key features of the 
optimization problem are discussed. We consider the application 
of the proposed algorithms to interference suppression in DS- 
CDMA systems. The results show that the proposed algorithms 
outperform the best known reduced-rank schemes, while requir- 
ing lower complexity. 
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The proposed set of constraints ensures that the multi-path 
components of the channel are combined prior to dimensionality 
reduction. In order to cost-effectively design the BARC scheme, 
we develop low-complexity decimation techniques, stochastic 
gradient and recursive least squares reduced-rank estimation 
algorithms. A model-order selection algorithm for adjusting 
the length of the estimators is devised along with techniques 
for determining the required number of switching branches to 
attain a predefined performance. An analysis of the convergence 
properties and issues of the proposed optimization and algorithms 
is carried out, and the key features of the optimization problem 
are discussed. We consider the application of the proposed 
algorithms to interference suppression in DS-CDMA systems. The 
results show that the proposed algorithms outperform the best 
known reduced-rank schemes, while requiring lower complexity. 
T 

Index Terms — nterference suppression, blind adaptive estima- 
tion, reduced-rank techniques, iterative methods, spread spec- 
trum systems.nterference suppression, blind adaptive estimation, 
reduced-rank techniques, iterative methods, spread spectrum 
systems.I 

I. Introduction 

Interference suppression in wireless communications has 
attracted a great deal of attention in the last decades [fl~), 
Motivated by the need to counteract the effects of wireless 
channels, to increase the capacity of multiple access schemes, 
and to enhance the quality of wireless links, a plethora of 
schemes and algorithms have been proposed for equalization, 
multiuser detection and beamforming. These techniques have 
been applied to a variety of standards that include spread spec- 
trum [3 1, orthogonal frequency-division multiplexing (OFDM) 
|0) and multi-input multi-output (MIMO) systems (5) and 
continue to play a key role in the design of wireless com- 
munications systems. 

A. Prior Work 

In order to design interference mitigation techniques, de- 
signers are required to employ estimation algorithms for 
computing the parameters of the filters used at the receiver 
or at the transmitter. In the literature of estimation algo- 
rithms, one can broadly divide them into supervised and blind 
techniques. Blind methods are appealing because they can 
alleviate the need for training sequences or pilots, thereby 
increasing the throughput and efficiency of wireless networks. 
In particular, blind estimation algorithms based on constrained 
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optimization techniques are important in several areas of signal 
processing and communications such as beamforming and 
interference suppression J6]. The constrained optimizations 
in these applications usually deal with linear constraints that 
correspond to prior knowledge of certain parameters such 
as direction of arrival (Do A) of users' signals in antenna- 
array processing J7] and the signature sequence of the de- 
sired signal in CDMA interference suppression [8 |. Numerous 
blind estimation algorithms with different trade-offs between 
performance and complexity have been reported in the last 
decades ll8l- lfT6l . The designs based on the constrained con- 
stant modulus (CCM) criterion EU, El, US, E), ffiO have 
shown increased robustness against signature mismatch and 
improved performance over constrained minimum variance 
(CMV) approaches |8), 0, ifTUl . In general, the convergence 
and tracking performances of these algorithms depend on the 
eigenvalue spread of the M x M full-rank covariance matrix 
R of the input data vector r[i] that contains M samples of 
the signal to be processed, and the number of elements M in 
the estimator |6j. When M is large, blind algorithms require 
a large number of samples to reach their steady-state behavior 
and may encounter problems in tracking the desired signal. 

Reduced-rank signal processing is a key technique in low- 
sample support situations and large optimization problems that 
has gained considerable attention in the last few years lfT7l - 
OTl . The fundamental idea is to devise a transformation in 
such a way that the data vector r[i] can be represented by a 
reduced number of effective features and yet retain most of its 
intrinsic information content 0171 , The goal is to find the best 
tradeoff between model bias and variance in a cost-effective 
way. Prior work on reduced-rank parameter estimation has 
considered eigen-decomposition techniques |[T8l . the multi- 
stage Wiener filter (MSWF) 0H, 03 that is a Krylov sub- 
space method, the auxiliary vector filtering (AVF) algorithm 
ED, ED, ED, ED, the joint and iterative optimization (JIO) 
strategy ED, ED, ED and adaptive interpolated filters EH, 
E3, ED- A major problem with the MSWF, the AVF-based 
and the JIO schemes is their high complexity. Prior work on 
adaptive interpolated filters ll24l . EH, ll26l has considered 
MMSE- and CMV-based designs and shown a significant 
performance degradation for rank reduction with large com- 
pression ratios. This problem has been recently addressed by 
the joint interpolation, decimation and filtering (JIDF) scheme 
E3, ll30l for supervised training. With the exception of the 
CCM-based MSWF of H3 and the JIO of ED, there is no 
blind reduced-rank that has low complexity, good performance 
and robustness against signature mismatches. 

B. Contributions of This Work 

In this work, we present a low-complexity blind adaptive 
reduced-rank constrained scheme (BARC) based on the CCM 
criterion and a reduced-rank decomposition using joint inter- 
polation, switched decimation and reduced-rank estimation. 
The proposed scheme is simple, flexible, and provides a 
substantial performance advantage over prior art. Unlike the 
JIDF scheme 1301 . the BARC uses an iterative procedure 
in which the interpolation, decimation and estimation tasks 



are jointly optimized using the CCM design criterion. In 
the BARC system, the number of elements for estimation is 
substantially reduced in comparison with existing full-rank and 
reduced-rank schemes, resulting in considerable computational 
savings and improved convergence and tracking performances. 
A unique feature of the BARC and the proposed algorithms is 
that, unlike existing blind schemes, they do not rely on the 
full-rank covariance matrix R for performing dimensional- 
ity reduction. The BARC and proposed algorithms skip the 
processing stage with R and directly obtain the subspace 
of interest and constraints via a set of simple interpolation, 
decimation and reduced-rank estimation operations, which 
leads to much faster convergence and improved performance. 
We develop low-complexity decimation techniques, stochastic 
gradient (SG) and recursive least squares (RLS) reduced-rank 
estimation algorithms. Differently from 11301 . these algorithms 
are designed with a set of constraints that are alternated in 
the optimization procedure. A model-order selection algorithm 
for adjusting the length of the filters is devised along with 
techniques for determining the required number of switching 
branches to attain a predefined performance. The proposed 
model-order selection differs from 11301 as it employs an 
extended filter approach, which is significantly simpler than 
the scheme in ll30l that uses multiple schemes in parallel. The 
algorithms for adjusting the number of branches are based 
on the constant modulus criterion as opposed to the mean- 
squared error (MSE) criterion employed in [13(71 . An analysis 
of the convergence properties and aspects of the proposed 
optimization and algorithms is also presented. We apply the 
proposed BARC and algorithms to interference suppression in 
DS-CDMA systems. 

This paper is organized as follows. The system model of a 
DS-CDMA system and the problem statement are presented 
in Section II. Section III is dedicated to the description of the 
BARC scheme and the CCM reduced-rank estimators. Section 
IV is devoted to the presentation of the blind adaptive SG 
and RLS estimation algorithms, adjustment of model-order 
selection and the number of switching branches, and their 
complexity. Section V provides an analysis and a discussion of 
the proposed optimization problem. Section VI presents and 
discusses the simulation results and Section VII draws the 
conclusions. 



II. System Model and Problem Statement 

Let us consider the uplink of a symbol synchronous DS- 
CDMA system with K users, N chips per symbol and L p 
is the maximum number of propagation paths in chips. A 
synchronous model is assumed for simplicity since it captures 
most of the features of asynchronous models with small to 
moderate delay spreads. The modulation is assumed to have 
constant modulus. Let us assume that the signal has been 
demodulated at the base station, the channel is constant during 
each symbol and the receiver is perfectly synchronized with 
the main channel path. The received signal after filtering by 
a chip-pulse matched filter and sampled at chip rate yields an 
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M-dimensional received vector at time i 

K 

r[i] = A k [i]b k \i]C k h k \i] + ri k \i] + n[i] 

k=l 



(1) 



where M = N + L p — 1, n[i] = [n>i[i] ... riM[i]] T 
is the complex Gaussian noise vector with zero mean and 
Bfn^n^ji]] = a 2 1 whose components are independent and 
identically distributed, where (.) T and (.) H denote transpose 
and Hermitian transpose, respectively, and E[] stands for 
expected value. The user symbols are denoted by b k [i], the 
amplitude of user k is the first term in (Q3 represents) 

the user signals transmitted over multipath channels including 
the inter-chip interference (ICI), and rj k [i] is the inter-symbol 
interference (ISI) for user k from the adjacent symbols. The ( 
signature of user k is represented by s k = [a k (l) . . . a k (N)] T , 
the M x L p constraint matrix C k that contains one-chip shifted 
versions of the signature sequence for user k and the L p x 1 
vector hk [i] with the multipath components are described by 



the convergence and tracking performance and reduce the 
complexity. This is performed via the reduction of the number 
of coefficients for computation from M (full-rank schemes) or 
D + MD (existing blind reduced-rank schemes) to less than 
a dozen. The structure of the BARC scheme is shown in Fig. 
Q] where an interpolator, a decimator with several switching 
decimation branches and a reduced-rank estimator which are 
time-varying are employed. 



C k = 



Ofc(l) 

a k (N) 






o fc (l) 
a k (N) 



, h k [i] = 



hk,L p -i[i] 



(2) 

The multiple access interference (MAI) comes from the 
non-orthogonality between the received signature sequences, 
whereas the ISI span L s depends on the length of the channel 
response and how it is related to the length of the chip 
sequence. For L p = 1, L s = 1 (no ISI), for 1 < L p < 
N, L s = 2, for N < L p < 2N, L s = 3, and so on. This 
means that at time instant i we will have ISI coming not 
only from the previous L s — 1 time instants but also from 
the next L s — 1 symbols. The linear model in ([U can be used 
to represent other wireless communications systems including 
MIMO and OFDM systems. For example, the user signatures 
of a DS-CDMA system are equivalent to the spatial signatures 
of MIMO system. 

A reduced-rank interference suppression scheme processes 
the received vector r[i] in two stages. The first stage performs 
a dimensionality reduction via a decomposition of r[i] into a 
lower dimensional subspace. The second stage is carried out 
by a reduced-rank estimator. The output of a reduced-rank 
scheme corresponding to the ith time instant is 
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w H \i]r\i] 



(3) 



where 
which 



Sd[i\ is an M x D decomposition matrix 
performs dimensionality reduction and 
w[i) — [w^ ...w^] T is the D x 1 parameter 

vector of the reduced-rank estimator. The basic problem is 
how to cost-effectively and blindly design the M x D matrix 
So[i] mat transforms the M x 1 vector r[i] into a D x 1 
reduced-rank vector r[i] using the CM criterion . 

III. Proposed BARC Scheme 

In this section we introduce the proposed BARC scheme 
and detail its key features. The motivation is to improve 
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Fig. 1. Proposed blind adaptive reduced-rank estimation structure. 

The M X 1 received vector r [i] is filtered by the interpolator 
with / being the length of the interpola- 



te i 



J i-il 



tor and yields the interpolated vector rj[i] = V H [i]r[i], where 
the M x M convolution matrix V[i] which has shifted copies 
of v[i] as described by 



V\i] = 



,['1 



J i-i 











J I-1 







(4) 



Let us now express the M x 1 vector ri[i] in a way that 
is suitable for algebraic manipulation as a function of the 
interpolator v[i\. 

r I [i\ = V H \i]r[i] = U {i]v*[il (5) 

where the M x / Hankel matrix 1321 with the received samples 
of r[i] performs the convolution and is described by 



»o[i] 
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' T 



r A/-2 r M-l 



' M-l 







(6) 



The M x 1 vector rj[i] is transformed by a decimation 
unit that contains B switching decimation patterns in parallel, 
leading to B different fl x 1 vectors fb[i], b = 1,...,B, 
where L is the decimation factor and D = M/L is the rank 
of the BARC system. This is inspired by diversity techniques 
found in wireless communications [35), whose principle is 
to collect different copies of signals and combine them to 
increase the signal-to-noise ratio, and switched control systems 
l36l that exploit switching rules to stabilize and design a 
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system. The decimation procedure corresponds to discarding 
M — D samples of r\ [i] with different patterns, resulting in B 
different D xl decimated vectors f b [i] . The D x 1 decimated 
vector for branch b is given by 



f b \i] = D b [i\n]i], b=i,...,B 



(7) 



where each row of Db[i] contains a single 1 and M — 1 zeros. 
The DxM decimation matrix D b [i] is equivalent to removing 
M — D samples of r\ [i] . The matrices D b [i] are designed off- 
line, stored at the receiver and the best Db[i) is selected to 
minimize a desired objective function. The output Zb[i] of the 
BARC scheme corresponds to filtering r&[i] with w[i] and then 
selecting the branch that minimizes the desired criterion. The 
output Zb[i] is a function of w[i], Db[i] and v[i] expressed by 

*b[i] = *> B \i]SZ, b mi\ = w H [i\(D b [i]V B [i]r[i]) 

= w H [i}(Db\i}$to\i])v*[i] = w H [i\R b \i]v*[i] (8) 



= v 



H i 



(§#[*]«*[*]) =v H \i]u[i\ 



where u[i] = 3l b [i]w*[i\ is an / x 1 vector, the D coefficients 
of w[i] and the / elements of v\i\ are assumed complex and 
the D x I matrix dt b [i] is dt b [i] = D b [i]dt a [i]. In what follows, 
we will develop constrained constant modulus (CCM)-based 
estimators and describe how the switching rule is incorporated 
into the proposed blind design. 

A. Joint Iterative CCM Design of Estimators and Discrete 
Optimization 

The design of the BARC scheme is equivalent to solving a 
joint optimization problem with v[i], Db[i] and w[i] using 
a strategy based on fixing two parameters and optimizing 
one, and alternating the procedure among the parameters until 
convergence. A key feature of this problem is that it involves 
a combination of continuous and discrete optimization proce- 
dures. Specifically, the design corresponds to the constrained 
continuous minimization of the estimators v\i\ and w[i] and 
the discrete minimization of D[i] according to the CCM 
design criterion. 

Let us describe the CCM estimators design of the BARC 
structure. The CCM design for v[i], Db[i] and w[i] can be 
computed through the optimization problem 



opt 1 



} = arg min J C m («[»], A> [i] , w [i]) , 

v[i\,D b [i\,w[i\ 



subject to Wj? [i]Sp[i]p[. 



(9) 



where the parameter v is a constant employed to enforce 
convexity and 

.2" 



J CM (v [i] , D b [»] , w [i] ) = E ( | w H \i}M\i}v* [i] | 



1 



(10) 



The decimation matrix D b [i] is selected to minimize the 
square of the instantaneous constant modulus error obtained 
for all the B branches according to 



D b [i] = D bB [{\ when b s = arg min (e 6 [i]) 2 

Kb<B 



(11) 



where the constant modulus error signal of the BARC scheme 
is e b [i] = \w H [i]Sp b [i]r[i]\ 2 - 1. With the selected deci- 
mation matrix Db[i], we can form the reduced-rank vector 



f[i] = Db[i] V H [i]r[i] that will be used in the following 
procedure for the design of v[i] and w[i\. 

By using the method of Lagrange multipliers, fixing w[i] 
and minimizing the Lagrangian with respect to v[i], the 
expression for the interpolator becomes 



v[i + 1] = R^li] [d u \i] - {pZ [i]^ 1 ^^])" 1 



(12) 



where R u [i] = E[\z[i]\ 2 u[i]u H d u [i] = E[z*[i]u[i]], 
u[i] — ?R^[i]w*[i] and p w [i] = P^[i]w[i]. The D x I matrix 
P [i] = D[i]dtp[i] arises from the constraint and the equiva- 
lence wg \i]S%\i]p\i] = w^\i]P T \i\v*\i] = «f \%]p w [i\ = v, 
where SRp[i] is a D x M Hankel matrix with elements of 
the effective signature p[i] shifted in a similar way to ©. 
By fixing the interpolator v [i] and minimizing the Lagrangian 
with respect to w[i], we obtain 

w[i + 1] = i?; 1 ^] [d z [{\ - (p H [i\R- 1 [i\p[i\r 1 



■ p\i}(p H [{]R; Wm-v)], 



(13) 



where R z [i] = E[\z\i]\ 2 f\i]f H \i}} = S%[i]R z \i]S D \i] 
, R z [{\ = E[\z[i\\ 2 r[i\r H [{\] , d,[*] = E[z*\i)r\i}} = 
S%[i]E[z'[i]r[t\, p[i] = S%\i]p\i] and S D [i] - D b \i]V H \i\. 
We remark that dTTt . (fTzl i and (fTjt depend on each other and 
their previous values. Therefore, it is necessary to iterate (fTTI) . 
( fT2b and ( TT3b in an alternated form (one followed by the other) 
with an initial value to obtain a solution. The expectations 
can be estimated either via time averages or by instantaneous 
estimates as will be described by the adaptive algorithms. 

B. Design of Decimation Schemes 

We are interested in developing decimation schemes that 
are cost-effective and easy to employ with the proposed 
BARC scheme. This can be done by imposing constraints 
on the structure of -D&[i]. Since the operator Db[i] performs 
decimation, the structure of Db[i] is constrained to contain 
only zeros and D ones. Thus, the decimation operation of the 
BARC scheme amounts to discarding samples in conjunction 
with filtering by v[i] and w[i\. The decimation matrix Db[i] 
is selected so to minimize the square of the instantaneous 
constant modulus error obtained for the B branches employed 
as follows 



D 



b\i\ = D bs [i 



when 6, 



arg min (e 6 [z]) 2 

Kb<B 



(14) 



where e&[i] = \w H [i]Sp fc [i]r[i]| 2 — 1. The design of the 
decimation matrix Db[i] considers a general framework that 
can be used for any decimation scheme and is illustrated by 



D 



dlb\i] 



d 



D,bl 



(15) 



where each row of the matrix Db[i] is structured as 



dj,b\i] = [0 



1 











(16) 



(M— 7j— 1) zeros 
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and the index j (j = 1,2, ...,D) denotes the j'-th row of 
the matrix, the rank of the matrix Db[i] is D = M/L, the 
decimation factor is L and B corresponds to the number of 
parallel branches. The quantity jj is the number of zeros 
chosen according to a given design criterion. 

Given the constrained structure of D&[i], it is possible to 
devise an optimal procedure for designing Dt,[i] via an exhaus- 
tive search of all possible design patterns with the adjustment 
of the variable 7^, where an exhaustive procedure that selects 
D samples out of M possible candidates is performed. The 
total number of patterns i? ox is equal to 



B cx = M • (M - 1) . . . (M — D + 1) = 



M 
D 



D terms 



We can view this exhaustive procedure as a combinatorial 
problem that has M samples as possible candidates for the first 
row of £>b [i] and considers M — j + 1 positions as candidates 
for the following D — 1 rows of the matrix Db[i], where j 
is the index used to denote jth row of the matrix Z3&[i]. The 
exhaustive scheme described above is, however, too complex 
for practical use because it requires D permutations of M 
samples for each symbol interval and M — 1 candidates for the 
positions, and carries out an extensive search over all possible 
patterns. 

It is highly desirable to employ decimation schemes that 
are cost-effective and gather important properties such as low- 
requirements of storage and computational complexity and can 
work with a small number of branches B. By adjusting the 
variable jj in the framework depicted in (fT5t . we can obtain 
the following sub-optimal schemes: 

A. Uniform (U) Decimation with B = 1. We make 7, — 
(j — 1)L and this corresponds to the use of a single 
branch (B = 1) on the decimation unit (no switching 
and optimization of branches), and is equivalent to 
the scheme in |26l . 

B. Pre-Stored (PS) Decimation. We select jj = (j — 
l)L + (b — 1) which corresponds to the utilization 
of uniform decimation for each branch b out of B 
branches and the different patterns are obtained by 
picking out adjacent samples with respect to the 
previous and succeeding decimation patterns. 

C. Random (R) Decimation. We choose jj as a discrete 
uniform random variable, which is independent for 
each row j out of B branches and whose values range 
between and M — 1. A constraint is included to 
avoid rows with repetitive patterns. 

IV. Blind Adaptive Estimation Algorithms 

In this section, we develop SG and RLS estimation algo- 
rithms J6| for estimating the parameters of the BARC scheme 
(t>[i],.D[i] and w[i]). The SG algorithms require the setting of 
step sizes and are indicated for situations where the eigenvalue 
spread of R z [i] is small. The RLS algorithms need the 
setting of forgetting factors and are suitable for scenarios 
in which R z [i] has a large eigenvalue spread. We also 
present blind model-order selection algorithms for adjusting 
the lengths D and / of the estimators and algorithms for 



determining the minimum number of branches required to 
achieve a predetermined performance. The model-order and 
number of branches selection algorithms are decoupled in 
order to reduce the search space and the computational cost. 
We have tested a joint search over /, D and B and this has 
not resulted in performance gains over the separate search over 
B and over / and D. Unlike prior work |f30l with the MSE 
criterion, the proposed algorithms employ the CM approach 
and rely on a set of linear constraints. The complexity of the 
proposed SG, RLS and model-order selection algorithms is 
compared with existing methods in terms of additions and 
multiplications. 

A. SG Algorithms for The BARC Scheme 

To design the estimators v[i] and w[i] and the decimation 
matrix D[i], we consider the Lagrangian 



£(v\i],D[i\,w\i}) = E\(\w H [i]?fi b [i]v*[i]\ 2 - I) 



25R 



{w H \i]S*\i}p\i}-v)\ 



(17) 



where A is a Lagrange multiplier and 5ft[-] denotes the real 
part of the argument. The input vector r [i] is processed by the 
interpolator v[i], yielding r\[i] = V H [i]r[i\. We then compute 
the decimated interpolated vectors ri\i\ for the B branches 
with the decimation matrix D&[i], where 1 < b < B. Once the 
B candidate vectors rb[i] are computed, we select the vector 
fb\i] which minimizes the square of 

e b [i]^\w H [i]S D ^r[i]\ 2 -1. (18) 

where S 1 £>,&[*] = V[i]D^ [i]. Based on the selection of -D&[i], 
we choose the corresponding reduced-rank vector f[i] and 
select the error of the proposed SG algorithm e[i] as the error 
eb[i] with the smallest squared magnitude of the B branches 
according to 



S D \i] 



when 



r[i] = rb s [i] and e[i] = eb s [i] 



(19) 



arg min (e b [i]Y 

Kb<B 



In order to derive an SG algorithm for v[i], we need to 
transform the proposed constraint in (|9) and obtain a suit- 
able and equivalent form for use with v[i]. We can write 
w H \i}S%\i}p\i] = w H \i}pT\i}v*\i] 



v. 



where p w [i] = P^[i]w[i] and the JJ x I matrix P [i\ is a 
function of Db[i] and p[i] and is given by P Q [i] = D[i]^- P [i], 
where 5R p [z] is a D x M Hankel matrix with elements of the 
effective signature p[i] shifted in a similar way to ©. We 
need to construct p w [i] for each symbol from P [i] and w[i]. 
Minimizing dTD i and using the proposed equivalent constraint 
v H [i]p w [i] — v, we obtain 

v[i+l] = \I-(Pw [i]P«,W) _1 P«,[^[<]V[«] 



(20) 

where \i v is the step size. Minimizing ( fTTI i and using the 
constraint w H [i]S^)[i\p[{\ = v, we obtain 

w[i + 1] = w[i] - n w e[i\z*[i} (i - {p H [i]pm~ 1 P[^P H 

(21) 
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where p, w is the step size. The SG algorithm for the BARC has 
a computational complexity 0(D+Ni) and employs equations 
CO^-tED- In fact, the BARC scheme trades off one SG 
algorithm with complexity 0(M) against two SG algorithms 
with complexity 0(D) and 0(1), operating simultaneously and 
exchanging information. 

B. RLS Algorithms for the BARC Scheme 

In order to design the estimators v[i], w[i] and the matrix 
D[i) with RLS algorithms, we consider the Lagrangian 



oT x R z [i - l]z[i]f\i] 



1 + a- l f H [i\z[i}R z [i - l]z*[i]f[i] 



(30) 



R z [i] = a~ 1 R x [i- 1] - or*k z [i\z*[i]f H [i}R z 

(31) 

and the initial values of the recursions are R z [i] = S W I 
and d z [Q] = p w , where 8 W and p w are small positive scalars. 
The RLS algorithm for the BARC has a computational cost 
of 0(D 2 ) + 0(I 2 ) and consists of equations (CDTl-CTi. 



£ls («[»], D[i\, w[i\) = al ~ l (\™ H lW]v* [i]| 2 



1 



25R [(w H \i]S ] [ S\i]p\i]-v)\ 



(22) 

where A is a Lagrange multiplier and a is a forgetting factor. 
We perform the signal processing according to the block 
diagram of Fig. 1. Based on the choice of Db[i], we select 
the corresponding reduced-rank vector f[i] and the error e[i] 
as the error e&[i] = \w H [i]Sj) t b[i]r[i]\ 2 — 1 with the smallest 
squared magnitude of the B branches as follows 

S D [i] = S D ,b a , r[i] = f b3 [i] and e[i] = e bs [i] 

when ^23) 
b s = arg min {e b [i]) 2 ■ 

l<b<B 

Minimizing ( 1221 with respect to v[i], using the constraint 
u ff [i]p w [«] = v and the matrix inversion lemma J6], we get 



v [i + 1] = R u 1 [i] ( d u [i] + (p^ [i] R„ 1 [i]p„ [i] ) ~ ] 



P w [i]{ d w[i]Ru [i]Pw\i] - v) 



where 



k u [i] 



d u [i] = ad u [i — 1] + (1 — a)z*[i]tt[i] 
a~ 1 R u [i — 



1 + a- 1 u H [i]z[i]R v 



V\z*\i\u\i\ 



(24) 

(25) 
(26) 



R u 1 [i] = a" 1 ^ [t - 1] - a _1 fe M [i]2:*[i]w H [i]-R u [i - 1] 

. -i (27) 

and the initial values of the recursions are R u [i] = S V I 
and d u [0] = p v , where 5 V and p v are small positive scalars. 
Minimizing d22"l i with respect to w[i], using the constraint 
™k i^^D = v anc l me me ma trix inversion lemma J6], 

we obtain 



w[i + 1] = [d z [i] + (p H [i]R z \i\pii\y 1 



where 



p[i\(p H [i\R z \i]d z [i\- 



d z [i] = ad z [i — 1] + (1 — a)2*[i]r[i] 



(28) 



(29) 



C. Model-Order Selection Algorithms 

This part develops model-order selection algorithms for 
automatically adjusting the lengths of the estimators used in 
the BARC scheme. Prior work in this area has focused on 
methods for model-order selection which utilize MSWF-based 
algorithms Q9) or AVF-based recursions ED, EH, EH. In 
the proposed approach we constrain the search within a range 
of appropriate values and rely on a CCM-based LS criterion to 
determine the lengths of v[i] and w[i] that can be adjusted in 
a flexible structure. The proposed scheme with extended filters 
is significantly less complex than the multiple filters approach 
reported in ||30l . The model-order selection algorithm for the 
BARC is called Auto-Rank and minimizes 

i 

C(v[i],D[i],w[{\) - al ~ l (\w H [i\D[i]U [l]v*[i\\ 2 - l) , 



i=i 



(32) 

The order of v[i], D[i], w[i], and the associated matrices 
flu[*]> an d tlz[i] defined in d27l i and OTT i. respectively, that 
are necessary for the computation of v[i] and w[i] require 
adjustment. To this end, we predefine v[i] and w\i\ as follows: 



v[t\ = i vi[i\ V2[i\ ... vi min m ■■■ «/ max N J 
w[i] = \ wi[i] w 2 [i] ... w D .[i] ... w Da H] 



(33) 



For each data symbol we select the best order for the 
model. The proposed Auto-Rank algorithm that chooses the 
best lengths D op t[«] and / op t[i] for the filters v\i\ and w[i], 
respectively, is given by 

{D opt [i] , / op t [i] } = arg min C {v [i] ,D[i],w [i] ) (34) 
where d and n are integers, D m i n and D max , and 7 m i n 



and Ir, 



are the minimum and maximum ranks allowed 



for the reduced-rank filter and the interpolator, respectively. 
The additional complexity of the Auto-Rank algorithm is 
that it requires the update of all involved quantities with the 
maximum allowed rank Z? max and I max and the computation 
of the cost function in d32l . This procedure can significantly 
improve the convergence performance and can be relaxed 
(the rank can be made fixed) once the algorithm reaches 
steady state. An inadequate rank for adaptation may lead to 
a performance degradation, which gradually increases as the 
adaptation rank deviates from the optimal rank. 
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D. Automatic Selection of the Number of Branches 

In this subsection we propose algorithms for automatically 
selecting the number of branches necessary to achieve a 
predetermined performance. This performance measure is de- 
termined off-line as a quantity related to the constant modulus 
cost function. The first algorithm, termed selection of the 
number of branches (SNB), relies on a simple search over the 
parallel branches of the BARC scheme and tests whether the 
predetermined performance has been attained via a comparison 
with a threshold p. The second algorithm builds on the SNB 
algorithm and incorporates prior statistical knowledge about 
the use of the branches via sorting and is denoted SNB-S. Let 
us first define for each time interval i the branch cost as 



Cbranch(«[«], D&[i], w[i]) = {e b [i]f 



(35) 



where 



e& \i\ 



is the error signal for each branch. The proposed algorithms 
for automatically selecting the number of branches perform 
the following optimization 

Bjil=argmin _min C hYgbach (v[i], T> b [i],w[i]) 

i? max 



mm 

K6<B m 



Subject tO Cbranch (v[i], D 6 [«], w[i]) < p 

(36) 

where b is an integer and £? max is the maximum number of 
branches allowed for the BARC scheme, respectively, B s is the 
number of branches required to attain the desired performance 
and p is the prespecified performance. The SNB algorithm 
determines the minimum number of branches necessary to 
achieve a predetermined performance e according to the cost 
function defined in (l35l ). It iteratively increases the number 
of branches by one until the predetermined performance p is 
attained. The parameter p can be chosen as a function of the 
MMSE with a penalty allowed by the designer. An alternative 
to the SNB algorithm is to exploit prior statistical knowledge 
about the most frequently used branches and sort the deci- 
mation matrices D b [i] in descending order of probability of 
occurrence. The SNB algorithm with sorting will be termed 
SNB-S and consists of ordering the matrices D b [i] which are 
most likely to be used. This can be done at the beginning of 
the transmission and updated whenever required. An important 
measure that arises from the SNB and SNB-S algorithms is 
the average number of branches i? avg = 1/Q 5^2= i B s [i] 
with Q being the data record, which illustrates the savings 
in computations of the branches. 

E. Computational Complexity 

In this section we detail the computational complexity of 
the proposed and existing SG, RLS and model-order selection 
algorithms, as shown in Tables I, II and III. This complexity 
refers to an adaptive linear receiver that only requires the 
timing and the spreading code of the user of interest. The 
computational requirements are described in terms of addi- 
tions and multiplications and have been derived by counting 
the necessary operations to compute each of the recursions 
required by the analyzed algorithms. The key parameters of 



the complexity are the length D of w[i] or the number of 
auxiliary vectors (AVs) for the AVF algorithm ED], (ED, ll22l . 
the number of samples M of r[i], the number of branches B, 
the length / of v\i\ and the number L p of assumed multipath 
components. 

TABLE I 

Computational complexity of SG algorithms. 

Number of operations per symbol 



Algorithm 


Additions 


Multiplications 


Full-rank-trained |6| 


Ul 


2M + 1 


(eq. (9.5)-(9.7) of M) 


2M 


2M + 1 


MSWF-trained COD 


2(D - l) 2 + 2D(M - 1) 


D 2 +3D + 2DM 


(eq. (53)-(62) of SM) 


+(D - 1)(M - 1) + M 


+M + 1 


Full-rank-CCM Q2L 


8M + ML P 


7M + ML P 


(eq. (10),(11),(13) afWl) 


+2 


+2 


MSVV1-CCM (T4l 


DM 2 + 3(D - if + 2D 


DM 2 + 2D 2 + 7D 


(table II of HW 


+3DM + 4M + 3 


+2DM + ML P + 2 


JIO-CCM [?] 


4DM + M 


4DM + M 


(eq. (14)-(15) of [ID 


+2D - 2 


+7D + & 


Proposed BARC-CCM 


4D + BD + 41 


4D + B(D + 1) 


(eq. (18)-(21) 


+(I - 1)M - 2 


+51 + IM + 4 



TABLE II 

Computational complexity of RLS and AVF-based 

ALGORITHMS. 



Algorithm 



Number of operations per symbol 



Additions 



Multiplications 



Full-rank-trained |6| 


3(M- If 
+M 2 + 2M 


3M 2 


(table 13.1 of M) 


+2M + 2 


MSWF-trained COD 


D 2 +2(D-lf 


4D 2 + 3D 


(eq. (69)-(71) ofgJSH) 


+2D(M - 1) + M 


+2DM + MLp 




+(Z>-1)(M-1) 

5(M - l) 2 
+M 2 +5M-1 


+4 


Full-rank-CCM Q3) 


4M 2 + 5M 


(eq. (6)-(10) cftHj) 


+L 2 p + MLp 


MSWF-CCM UD 


+3(L P - l) 2 
DM 2 + 5D 2 


+L p + 4 


DM 2 + 6D 2 


(table III of SIB) 


+2DM + (M — l)L p 


+2DM + (D — 1)M 




+(D-1)(M + 1) 
5M 2 + DM 


+7D + MLp + 4 
6M 2 + (2D + 6)M 
+5D 2 +9D + 3 


JIO-CCM [?] 


(eq. (10)-(11) off!]) 


+5D 2 + 3D - 1 


AVF-trained (20l 


L>(3M 2 - 2M) 


D(4M 2 + 3M) 


(eq. (3),(U)-(13)rfWl 


+2M - 1 


+4M + 2 


Proposed BARC-CCM 


6D 2 + (B + l)D + 6/ 2 


7D 2 + (B + 8)D + 71 


(eq. (23)-(31)) 


+1 + (I - 1)M + 8 


+71 + IM + 3 



In Fig. 2 we illustrate the main complexity trends by show- 
ing the computational complexity in terms of the arithmetic 
operations as a function of the number of samples M. We use 
the same colors for the corresponding SG techniques in Fig. 
2 (a) and the RLS counterparts in Fig. 2 (b). For these curves, 
we consider L p = 9, D = 5, / = 3 and B = 8 for the BARC, 
assume D = 4 for the MSWF-SG based approaches, while 
we use D = 5 for the MSWF-RLS techniques and D = 8 
for the AVF technique with non-orthogonal auxiliary vectors 
(AVs) 1201 . lETl . lF2Tl . The reason why we use different values 
for D is because we must find the most appropriate trade- 
off between the model bias and variance ifTTl by adjusting 
D (AVs for the AVF) and this depends on the scheme. We 
always use the best values for each scheme. The curves in 
Fig. 2 (a) show that the reduced-rank BARC SG algorithms 
have a complexity slightly higher than the full-rank trained 
SG algorithms and substantially lower than the other analyzed 
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table iii 

Computational complexity of model-order selection 
algorithms. 



Algorithm 
Auto-Rank 
(Extended Filters) 

Projection with 
Stopping Rule QU 

CV (20) 



Multiple Filters (30) 
(JIDF or BARC) 



2(A 
2(7, 



Additions 

lax D- 



min J ~f~ 1 

+ l 



2(2M - l)x 

(D ill a x 

- D 

m in 



(2(D„ 



(2M- l)x 

Drain) 



+ 1) 
+ l 



2(D m 

fa ^-D-max ; ^-max ) ~f~ ■ ■ ■ 
H~ fa [Djjiin , Imin') 

-D 

mm 

) + 1 



Imin) additions, as depicted in the first row of Table III, in 
addition to the operations required by the proposed algorithms, 

whose complexity is shown in the last rows of Tables I and 

u rjMca log ^ operation of the MSWF and the AVF algorithms 
with model-order selection algorithms, a designer must add 
the complexities in Tables I and II to the complexity of the 
~*~ i) ^th^pl-order selection algorithm of interest, as shown in Table 
mm III. The model-order selection algorithm with multiple filters 
(Dmax — D m i n fcaJ) h number of arithmetic operations that is substantially 
M + 1 higher than the other compared methods and requires the 
gI pmputation of (L> max - D min + 1) + (I max - 7 min + 1) pairs 
" iters with costs f a (D,I) and f m (D,I) for additions and 
+/m (TJmin, IiWWjtiplications, respectively, for each pair of filters with D and 
2(Imax — -fmin y+Spec jficaiiy. these costs are shown as a function of D and 



(A, 



+7D 2 H 

1 1 ^ max I 
/m(^mai i 7 ma 



reduced-rank algorithms. For the RLS algorithms, depicted in 
Fig. 2 (b), we verify that the BARC reduced-rank scheme 
is much simpler than any full-rank or reduced-rank RLS 
algorithm. This is because there is a quadratic cost on M 
rather than D for the full-rank schemes operating with the 
RLS algorithm and a high computational cost associated with 
the design of the transformation matrix S n [i] for all reduced- 
rank methods except for the BARC scheme. The AVF scheme 
l20l . I2l1l . Il22ll usually requires extra complexity as it has 
more operations per auxiliary vector (AV) and also requires 
a higher number of AVs to ensure a good performance. The 
trained AVF employs a cross-correlation vector estimated by 
i>[i\=ap[i-l] + {l-a)b* k \i]r\i}. 



(a) Complexity of SG Algorithms 



(b) Complexity of RLS Algorithms 



10" 



z 10' 




Full-rank-trained 

- Full-rank-CCM 
MSWF-trained 

- MSWF-CCM 

- JOINT-CCM 
- BARC-CCM 

- AVF-trained 



100 



200 



300 




300 



Fig. 2. Complexity in terms of arithmetic operations of (a) SG and (b) RLS 
algorithms and AVF-based recursions. 



The computational complexity of the proposed model-order 
selection algorithm (Auto-Rank) and the existing rank selec- 
tion algorithms is shown in Table EH. We can notice that the 
proposed model-order selection algorithm with extended filters 
is significantly less complex than the existing methods based 
on projection with stopping rule [T9| and the CV approach 
Specifically, the proposed rank selection algorithm with 



extended filters only requires 2(D r 



Drain) "F 2(7^ 



/ at the bottom of Table III and we have for the SG version 
f a (D, I) = AD + BD + 41 + (I - 1)M - 2 additions and 
f m (D, I) = AD + B(D + 1) + 57 + IM + 4 multiplications 
(see the last rows of Table I), whereas for the RLS version we 
have f a (D, I) = 6D 2 + {B + 1)D + 6I 2 + 1 + (7 - 1)A7 + 8 
additions and f m (D, I) = 7D 2 + (B+8)D+7I 2 + 7I+IAI+3 
multiplications (see the last rows of Table II). It . Despite the 
cost, its performance is comparable with the proposed model - 
order selection algorithm with extended filters. 

V. Analysis of the Proposed Algorithms 

In this section, we develop a stability analysis of the 
proposed method and SG algorithms and study the conver- 
gence issues of the optimization problem. Specifically, we 
study the existence of multiple solutions and discuss strategies 
for dealing with it. We consider particular instances of the 
proposed algorithms for which a global minimum may be 
encountered by the proposed SG and RLS algorithms. We 
also examine cases for which there is no guarantee that the 
algorithms will converge to the global minimum and may 
end up in local minima. It should be mentioned, however, 
that the proposed SG and RLS algorithms were extensively 
tested for a number of applications and numerous scenarios. 
It was verified in these experiments that the algorithms always 
converge to approximately the same filter values irrespective 
of the initialization. This suggests that the problem may have 
multiple global minima or that every point of minimum is 
a point of global minimum or that the switching of branches 
allows the algorithms to find the global minimum. Specifically, 
we are interested in examining three cases of adaptation and 
parameter estimation, namely: 

• Case i) - Sd[i\ is fixed, i.e. the interpolator v[i] and the 
decimation matrix D[i] are fixed. 

• Case ii) - Sd[i\ is time-variant with D [i] being fixed and 
v[i] being time-variant. 

• Case iii) - Su[i] is time-variant, where D[i] and v[i] are 
both time-variant. 

• Case iv) - So[i] is time-variant, where D[i] is time- 
variant and v[i] is time-invariant. 

For the sake of analysis and the convexity issues of the 
problem, we have opted for studying the method for the 
four cases previously outlined. This allows us to gain further 
insight and draw conclusions on the properties of the different 
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configurations of the method. A key feature of the proposed 
method which makes its convergence study extremely difficult 
is the combined use of discrete and continuous optimization 
techniques. Even though the necessary conditions for the opti- 
mization algorithms are met ll33l . Il34l and the cost functions 
used for deriving the SG and RLS algorithms are continuously 
differentiable, the discrete nature of the decimation and the 
patterns used make its theoretical analysis highly challenging. 
This proof is beyond the scope of this paper and remains a 
very interesting open problem. 

A. Stability Analysis 

In this part, we examine the stability of the proposed SG 
algorithms. In order to establish these conditions, we define 
the error matrices at time i as 



where 
A = 



{I - ii v e[i]B w [i]u[i]D H [i\r H [i}} ^ v e\i]B w [i\u[i]D H [i]r H [i]S D 
{I - Li w e[i]Il{i]SZ\i}r\i}r H {i\S D \i]} -^ w e[i\Tl[i\SE[i\r[i\r H [i\ 



+^e[i\B w [i\u[i\D H [i\r H [{\{S D [i\{I - w 



O p t 



- S 



_D,opt , 



-fi w e[i]TI[i]Sa[i]r[i]r H [i](es D [i](w opt - I) + SD,o P tw opt ) 



Dx(M- 



Es D [i] = Sd[i\ — Sd, opt and 



w\i\ — w 



(37) 



opt? 



where w opt and So,opt are the optimal parameter estimators. 
Since we are dealing with a joint optimization procedure, 
both filters have to be considered jointly. At this point, we 
need to introduce a mathematical manipulation that allows the 
expression of S D [% + 1] = V[i + l]D H [i + l] as a function 
of the recursion in (|20t . We can rewrite Sd\i + 1] as 

M 

S D [i + 1] = V[i + l}D H [i + 1] = B M* + ^}D H [i + 1] 



The previous equations imply that the stability of the algo- 
rithms depends on the spectral radius of A. The parameters 
of w[i] and Soli] will remain bounded and will converge 
asymptotically to the optimal values if the step sizes are 
chosen such the eigenvalues of A H A are less than one. Unlike 
the stability analysis of most adaptive algorithms J6), in the 
proposed approach the terms are more involved and depend 
on each other as evidenced by the equations for A and C. Let 
us now examine the three cases outlined at the beginning of 
this section. 

For case i), the transformation Sd is fixed and we can 
consider only the recursion for the error vector e^,[i], which 
yields 



e*,[i + 1] = (J- iJ w e\i]U\i]S D r\i]r H \i]S D )e ii ,\i] 



fi w e\i]n\i]S%r\i]r H \i]S D w 



(42) 



P t- 



i=i 



M 



M 



= Y^B l v\ l ]D H [ 1 ]-^e[ l ]z*[ 1 ]Y J B l {I-{pg[i 

i=i i=i 

= SdU 



fx v e[i]z* [i]B w [i] u [i] D H [i] , 



(38) 



where the M x I matrix 



ElliBAi 



(p&[i]Pii,[i]) 1 Pa,[i]Pa[i] j > an d tne M x I matrix B t has 
an /-dimensional identity matrix starting at the Z-th row, is 
shifted down by one position for each I and the other elements 
are zeros. 

By substituting the expressions of Es D [i] and [i] in ( 1381 
and d2Tb . respectively, and rearranging the terms we obtain 

E Sd [i + 1] = {I - (j, v e[i]B w [i}u[i]D H [i]r H [i]}E Sz) [i] 

- fx v e \i] B w [%] u \i] D H [i] r H [i] S D [i] e„ [»] 

+ ^ v e[i]B w [i]u[i]D H [i]r H [i]{S D [i]{I - w opt ) 

(39) 

e*,[i + 1] = {I - /x«,e[i]n[i]Sg[i]r[i]r H [i]SB[i]}e^[i] 

- ^ w e\i]n[qS%\i}r\i]r H \i}Es D [i] 

- ^ w e[i]U[i]SD[i]r[i]r H [i](E SD [i](w ^ - I) + 

(40) 

where Tl[i] = I — (p H [i]p[i])~ 1 p[i]p H [i]. Taking expec- 
tations and considering the two error matrices together, we 
obtain 



Taking expectations on both sides, using the fact that 
S[e»o[i]] = and computing R^, = E'fe^ije^i]] we get 

p 4i ] =^^M^is%RSl)R a (I - f, w E[e[i\Tl[i\}S^Rr H [{\S 
^Eile^miSnRSDW^S^RSDEiU 11 ^], 

(43) 

where R = E[r[i]r [i]] is the M X M covariance matrix of 
the input r[i). Using well-known results from the theory in 
0, we have the following stability condition 



< /j w < 



tr 



E[e\i]U\i}] 



S^RSd 



(44) 



E[E SD [i + l]] 



_ E[ eib [i + 1] 







Dx(M-l) 



= A 



E[Es D [i]] 
E[ew[i]\ | Dx(M _!) 

(41) 



For case ii) we assume that D[i] is fixed and v[i] and w[i] are 
time-variant, which means the trajectories of So[i] an d 
must be considered jointly. Therefore, the equation in (|4H 
should be used in the analysis. For stability, the step sizes 
jslj^uld be adjusted such that the eigenvalues of A H A are less 
than offe. bespite this condition of stability the algorithms may 
converge to local minima. In what follows, we will study this. 

For cases iii) and iv), we consider that D[i], v[i] and 
w[i] are time- variant and D[i] and w[i] are time-variant, 
respectively. The condition of stability is different from the 
l&fei'es since D[i] is a discretely optimized parameter 
and v [i] and w [i] are parameter vectors that are continuously 
optimized. The equation in ( HTb still holds but the discrete 
nature of D[i] makes a precise stability analysis impractical 
since D[i] is switched every time instant. In addition, the 
problem becomes very difficult to treat since local minima 
may aYise due to the joint adaptation of D[i], v[i] and w[i) 
(case iii)) and the joint adaptation of D[i] and w[i] (case iv)). 
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B. Analysis of the Optimization Problem 

Let us now consider an analysis of the joint optimization 
method from the point of view of the cost function and 
the constraints. Our strategy is to examine the four cases 
previously outlined and draw conclusions on what happens 
to the nature of the optimization problem. Let us drop the 
time index [i] for simplicity and define the cost function 



Jcm(v,D,w) 



= E 


\\w H S D r\ 2 - 


- 1 


)1 


= E 


(\w H D$l v* 


2 


-I) 2 


= E 


~(\t H Ut\ 2 -l 


f 




= E 


"|z| 4 -2|z| 2 + 


1], 



(45) 



where the (D + I) x 1 parameter vector t = [w T v T ] T con- 
siders together the reduced-rank estimator and the interpolator 

and the (D + I) x (D + I) matrix U = \T q 

contains the samples of the received vector and the decimation 
matrix. 

The received vector in ([T) can be rewritten as r = x + 
r\ + n, where x = Ylk=x AkbkPk an d Pk = C k h k . Since the 
symbols b k , k = 1, . . . , K are i.i.d. complex random variables 
with mean zero and unit variance, b k and n are statistically 
independent, and we have R = R x + R r; + tr 2 I, where R x = 
E[xx H ] and R v = E[r]T] H ]. 

Let us consider a desired user and its corresponding trans- 
formation matrix Sd and reduced-rank estimator w. We can 
express the interference free desired signal as 

q k = A kP H S D w (46) 

and the composite signal as 

q = A[ Pl , pa, p K ] H S D w = AP H S D w, (47) 



. . Ak) is a K x K diagonal matrix with 
Pi ■ ■ ■ Pk] i s & M x K matrix with the 



where A = diag(Ai 
the amplitudes, P = 
effective signatures. 

Now let us make use of the constraint w rSupj = 
w H SoCkhk = v and the relation between Sd, w, the 
channel and the signature Cf S D w = vh k H21, |[T4l . 
We then have for the desired user the equivalent expressions 

q k = A k p^S D w = A k h?C"w = vA k h^h k 

= A k p%VD H w = A k v H U^D H w = A k t H U^t, 

(48) 





_ {DU P ) T _ 
lifted versions of 



where the (D + 1) x (D + 1) matrix U T 

and the M x I Hankel matrix contains s 
the effective signature p k of the desired user. 

At this point, we can exploit the previous expressions and 
substitute them into the cost function in d45l l. Assuming for 
simplicity the absence of noise and ISI, the cost function of 



the desired signal can be expressed as 
Jcu{q) = E[{q H bb H qf] - 2E[(q H bb H q) 



K 



K 



K 



8(^ + E 



QlQl 



4F 2 -4^(g i gD 2 -4^-4^( (Z , 



ill 



i=i 



i=i 



1=2 



(49) 



where F 



. H 



v 2 A 2 k \h k h k \ 2 and b = 
[bx ■ ■ ■ bK\ T is a A" x 1 vector with the transmitted symbols. 

In order to study the properties of the optimization of d49h 
we proceed as follows. We take advantage of the constraint 
w SoP k = v and rewrite ( |49l as 

K 

JcM{q) = 8(F+q H q) 2 -4(F 2 +Y,(qiq:) 2 )-4(F+q H q) + l, 

(50) 

A' H P' H , P' = 



1=2 



where q = [q 2 , . . . , qKf = TS D w, T 
[p 2 . . . p K ] and A 1 = diag(A 2 . . . A K ). 

The previous development allows us to examine the four 
cases outlined at the beginning of the section via the compu- 
tation of the Hessian matrix (0) using © = g|y , 
Specifically, is positive definite if m H ®m > for all 
nonzero m G (j K ^ lxK ^ 1 [32] The computation of is 
given by 

= 16 



(F - 1/4)1 + q H qI + qq H - diag( 



92 1 



QK 



(5 



) 



where the first term depends on F and the selection of some 
key parameters, the second term is positive definite, and 
the third and fourth terms of (|5H are positive semi-definite 
matrices. We will now consider the four cases of interest for 
our analysis. 

For case i), we assume Sd fixed and F yields the condition 
v 2 A 2 k \hl h k \ 2 > 1/4, (52) 

that ensures the convexity of the optimization problem in the 
noiseless case. Since q = TSdw is a linear mapping of Sd 
and w, then Jcm(q) is a convex function of q and implies 
that Jcm(S d,w) = Jcm{TSdw) is a convex function of 
S D w. 

For case ii), we suppose that Sd is time-variant due to the 
interpolator v and we shall consider v and w jointly via the 
parameter vector t. In this case, F yields the condition 



Al\t H U 



H.\2 



t\ 2 > 1/4, 



(53) 



Although the optimization problem depends on the parameters 
v and w which suggests a nonconvex problem, there is the 
possibility of modifying the problem with the condition above. 
As the extrema of the cost function can be considered for 
small o~ 2 a slight perturbation of the noise-free case Hill , the 
cost function is also convex for small a 2 provided the above 
conditions hold. 

For case iii), we assume that D, v and w are time-variant. 
The discrete nature of D and the switching between branches 
are clearly associated with a nonconvex problem for which 
there is no easy or known strategy to enforce convexity. 
Interestingly, the switching does not affect the final values 
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of the parameter vectors v and w which converge to the same 
steady state values regardless of the initialization, provided v 
and Sd are not all-zero quantities. 

For case iv), we consider that v is time-invariant, and 
D and w are time-variant. The discrete nature of D and 
the switching between branches are again associated with a 
nonconvex problem for which there is no simple strategy to 
enforce convexity. An analysis of this problem for cases iii) 
and iv) remains an interesting open problem. 

VI. Simulations 

In this section we evaluate the bit error rate (BER) perfor- 
mance of the proposed BARC scheme and algorithms in a 
DS-CDMA interference suppression application. We consider 
the system model detailed in Section II and model the channel 
as a finite impulse response (FIR) filter represented as the 
L p x 1 channel vector h k [i] = [hk,o[i] ■ ■ ■ h k , Lp -i [i]] T EH 
The system employs random sequences of length N = 32 
and N = 64. All the multipath channels are time-varying 
and are generated according to Clarke's model ||35l , which 
is parameterized by the normalized Doppler frequency /r>T, 
where fn is the Doppler frequency and T is the inverse of the 
symbol rate. We assume L p = 9 as an upper bound, which 
means r[i] has M = N + L p - 1 = 40 when N = 32 and 
M = 12 taps when N = 64, respectively. In this case, the ISI 
corresponds to 3 symbols namely, the current, previous and 
successive symbols. In all simulations, we assume L p = 9 as 
an upper bound, 3-path channels with relative powers given 
by 0, —3 and —6 dB, where in each run the spacing between 
paths is obtained from a discrete uniform random variable 
between 1 and 2 chips and we average the curves over 200 
runs. The system has a power distribution among the users for 
each run that follows a log-normal distribution with standard 
deviation equal to 1.5 dB. The blind algorithms employ the 
CCM criterion, adaptive linear receivers that assume perfect 
synchronization and know the spreading code of the user of 
interest. The number of users K does not affect the complexity 
of a receiver designed for a particular user. We measure the 
BER of the desired user and compare the BARC scheme 
with the full -rank lfT2l . lfT3l . reduced-rank schemes with the 
MSWF method 04]], the AVF scheme with training j20], the 
JIO technique [?] and the SVD-based approach that selects 
the D largest eigenvectors |[T8l to compute the transformation 
matrix So[i] an d the MMSE, which assumes the knowledge 
of the channels and the noise variance. All algorithms have 
their parameters optimized with respect to the BER for each 
scenario and the blind algorithms employ the blind channel 
estimator of ||37l to compute the effective signature p[i]. The 
phase ambiguity derived from the blind channel estimation 
method in ll37l is eliminated in our simulations by using the 
phase of [0] as a reference to remove the ambiguity. 

A. Model-Order Adjustment 

In most estimation algorithms, it is necessary to adjust 
parameters such as order, step size and forgetting factor. In 
the proposed BARC scheme, a key issue is the setting of the 
number of elements or the rank of the estimators v\i] and 



w[i] used. We have conducted experiments in order to obtain 
the most adequate rank for the interpolator v[i], with values 
ranging from 3 to 8 and for the reduced-rank filter w [i] with 
values ranging from 1 to 16. Notice that values beyond that 
range are unnecessary since it does not lead to performance 
improvements. 

N=64, K=24, E /N =15 dB, f T=0.0005, Data record = 500 symbols 




Full-Rank-CCM-RLS 
MSWF-CCM-RLS 
MSWF-RLS 
A Full-Rank-RLS 

1 AVF 

JIO-CCM-RLS 
— a — BARC(I=3,B=12)-RLS 
MMSE 



12 



14 



16 



Rank (D) 



Fig. 3. BER performance against rank (D) for the analyzed schemes using 
RLS algorithms. 



The results in Figs. [3] and |4] for a wide range of scenarios 
indicate that the performance is good for a small range of the 
number of taps in v[i] and w[i]. While the BARC scheme 
is not able to construct an appropriate subspace projection 
with only a few coefficients in v(i) and w(i), there is no 
improvement in the tradeoff between model bias and noise 
variance and the estimation task becomes slower when the 
length of the estimator is greater than 6. Thus, for this reason 
and to keep a low complexity we adopt I = 3 and D = 5 
for the next few experiments since these values yield the best 
performance. 



N=64, K=24 users, SNR=15 dB, Data record =500 



- BARC-SG (D=5, B=12) 

- BARC-RLS (D=5, B=12) 
-MMSE 




3 4 5 

Interpolator Rank (I) 



Fig. 4. BER performance against interpolator rank (I) for the analyzed 
schemes using SG and RLS algorithms fjT = 0.0005. 
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B. Impact of Number of Branches and Decimation Schemes 

In this part, we evaluate the performance of the pro- 
posed BARC scheme and algorithms for different decimation 
schemes, and the impact of the number of branches on the 
performance. 



of decimation branches B on the performance for various 
ranks D with a data support of 1500 symbols and the PS-DEC 
decimation approach. The results, depicted in Fig. [6] indicate 
that the performance of the BARC scheme improves as B is 
increased and approaches the optimal MMSE estimator, which 
assumes that the channels and the noise variance are known. 



-6 — Full-Rank-SG 
■* — MSWF-CCM-SG 

— SVD-based-SG 
^ — JIO-CCM-SG 
A— BARC-UNIF-DEC(D=16)-SG 

— BARC-PS-DEC-SG 
-# — BARC-RAND-DEC-SG 
< — BARC-OPT-DEC-SG 
MMSE 




2000 



500 1000 1500 

Number of received symbols 

Fig. 5. BER performance versus number of received symbols. 



In order to assess the proposed decimation methods, we 
compute the BER performance of the algorithms for the uni- 
form (U-DEC), the random (R-DEC), the prestored (PS-DEC) 
and the optimal (OPT-DEC) schemes. The results, shown 
in Fig. |5j indicate that the BARC scheme with the optimal 
decimation (OPT-DEC) achieves the best performance, fol- 
lowed by the proposed method with prestored decimation (PS- 
DEC), the random decimation system (R-DEC), the uniform 
decimation (U-DEC), the MSWF, the SVD and the full-rank 
approach. Due to its exponential complexity, the optimal dec- 
imation algorithm is not practical and the PS-DEC is the one 
with the best trade-off between performance and complexity. 



N=32, K=1 2, E IN =1 2 dB, 1 T=0.0001 , Data record = 500 symbols 
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- BARC(D=4)-SG 
BARC(D=3)-SG 

- SVD-based-SG 
JIO-CCM-SG 
MSWF-CCM-SG 

- MMSE 



5 10 15 

Decimation Branches (B) 



Fig. 6. BER performance versus number of decimation branches. 



In the next experiment, we evaluate the effect of the number 



C. Performance with Model-Order Selection 

In the next experiments, shown in Figs. [7] and [8] we assess 
the performance of the BARC scheme with the proposed 
model-order selection algorithm and mechanisms to determine 
the minimum number of branches necessary to attain a prede- 
fined performance as described in Section VI. 




500 1000 1500 500 1000 1500 

Nmber of received symbols Number of received symbols 



Fig. 7. BER performance against number of symbols for different 
model-order selection algorithms with (a) SG and (b) RLS recursions. 

The evaluation of the model-order selection algorithms is 
shown in Fig. [7] where we consider the BARC scheme with 
SG and RLS algorithms, B = 16, D m i n = 3, D max = 6, and 
Jmin = 2 and jT max = 6. We compare a configuration of the 
BARC scheme with 1 = 2 and D = 3, a second configuration 
of the BARC with I = 6 and D = 6, the BARC with 
the proposed model-order selection algorithm (Auto-Rank) 
with extended filters, the BARC with the method based on 
the stopping rule of OH and the BARC with the CV-based 
algorithm of [|20). Notice that the BARC with the model- 
order selection algorithm based on multiple filters obtains 
a comparable performance to the Auto-Rank approach (the 
curves overlap and for this reason we do not shot it), however, 
the former is significantly more complex. The results indicate 
that the Auto-Rank allows the BARC scheme to achieve fast 
convergence and excellent steady state performance, which is 
close to the optimal MMSE. The performance of the Auto- 
Rank is slightly better than the stopping rule approach of |fT9l 
and the CV-based technique of ||20| . The proposed Auto-Rank 
algorithm is less complex than the algorithms of 0191 and lEUl 
as it reduces the number of possible ranks to be used by the 
estimators by constraining them in a preselected range and 
does not require the computation of orthogonal projections as 
in ED. 
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In the next experiment, we assess the proposed SNB and 
SNB-S algorithms for automatically selecting the necessary 
number of branches to attain a predefined performance. The 
results are shown in Fig. [8] for an identical scenario to Fig. |7] 
We consider the BARC scheme with SG and RLS algorithms 
and the Auto-Rank algorithm for different values of B, and 
the proposed SNR and SNR-S algorithms. The parameter p 
was set equal to 4% greater than the MMSE and £? max = 16 
for the experiment. The results indicate that the proposed 
branch adaptation techniques allow the BARC scheme to 
achieve a performance comparable to the BARC scheme with 
B = 16. In particular, the proposed SNB algorithm achieves 
this performance with B avg = 7.6, whereas the proposed 
SNB-S technique attains this performance with B avg = 5.4 
due to the use of a priori knowledge of the frequency of branch 
usage. In the following example, we consider the model- 
order selection and SNB-S algorithms for the BARC with the 
same parameters used in the previous experiment and the rank 
adaptation mechanisms proposed in |[T9l for the MSWF and 
in flU for the AVF 



(a) N=64, K=24, E b /N Q =1 5 dB, fjT.0.0001 (b) N=64, K=24, E b /N =1 5 dB, f d ~T=0.0001 




500 1000 1500 500 1000 1500 

Number of received symbols Number of received symbols 



Fig. 8. SINR performance against number of symbols with (a) SG 
and (b) RLS recursions. 



D. Performance with Different Loads and SNR Values 

In the last experiment, we assess the schemes and algorithms 
by computing the BER performance against E^/Nq and the 
number of users, as depicted in Fig. [9] The BER is evaluated 
for data records of 1500 QPSK symbols and a scenario where 
the trained receivers employ pilot signals for estimating their 
parameters with SG and RLS algorithms, whereas the blind 
algorithms operate without any assistance. The maximum 
number of branches for the BARC scheme is -B max = 16 
and we employed the proposed SNR-S algorithm. 

The results show that the BARC scheme with both SG and 
RLS algorithms achieves a BER performance very close to the 
optimal MMSE, that assumes known channels, is followed by 
the AVF, the MSWF-RLS and the full-rank. Specifically, the 
BARC scheme with the SG algorithm can save up to 4 dB 
in E b /N as compared to the AVF and the MSWF-RLS for 



N=64, K=24, f T=0.0001 N =64, K=24, E K /N =1 5 dB f J=0.0001 

u bud 




5 10 15 20 6 12 18 24 30 36 42 48 



E b /N Q (dB) Number of users (K) 

Fig. 9. BER performance versus (a) E^/Nq (b) number of users for a data 
record of 500 symbols. 



the same BER and can accommodate up to 6 more users as 
compared to the AVF and the MWF-RLS for the same BER. 

VII. Conclusions 

This work proposes the BARC scheme and blind adaptive 
algorithms for interference suppression in wireless commu- 
nications systems. The proposed BARC scheme employs a 
reduced-rank decomposition based on the concept of joint in- 
terpolation, switched decimation and reduced-rank estimation 
subject to a set of constraints. The proposed set of constraints 
ensures that the multi-path components of the channel are 
combined prior to dimensionality reduction. We have devel- 
oped low-complexity SG and RLS reduced-rank estimation 
and model-order selection algorithms along with techniques 
for determining the required number of switching branches to 
attain a predefined performance. We have applied the proposed 
algorithms to interference suppression in DS-CDMA systems. 
The results of simulations indicate that the proposed BARC 
scheme allows a substantially better convergence and tracking 
performance than existing reduced-rank and full-rank schemes. 
This is due to the dimensionality reduction carried out by the 
proposed scheme that allows the use of adaptive algorithms 
with very small estimators. The proposed algorithms can 
be applied to other applications including MIMO systems, 
beamforming, broadband channel equalization and navigation 
systems. 
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